Skip to content

make training and sweeps work on amd gpu hip/rocm#562

Open
sozforex wants to merge 2 commits into
PufferAI:4.0from
sozforex:rocm_for_40
Open

make training and sweeps work on amd gpu hip/rocm#562
sozforex wants to merge 2 commits into
PufferAI:4.0from
sozforex:rocm_for_40

Conversation

@sozforex
Copy link
Copy Markdown
Contributor

@sozforex sozforex commented May 14, 2026

Allows training and sweeps to run on amd gpu [hip/rocm],
cuda specific code is hipified during build and is placed in build/hip/src.

There has been a problem with sweep_obj being passed between processes when protein sweep is on gpu [apparently special synchronization handling is implemented for cuda, but not rocm in torch], so this PR contains a change related to that.

Tested only breakout sweep / eval on amd gpu.

Mostly done through codex gpt-5.5.

@jsuarez5341 please let me know if you care to merge amd gpu support in a shape similar to this draft PR

Support AMD GPU sweep execution, keep early-stop diagnostics out of persisted metrics, and add the HIP/CUDA shim needed for ROCm builds.
@sozforex sozforex marked this pull request as ready for review May 23, 2026 05:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant